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Abstract 

Unlimited access to a motorway network can, in overloaded conditions, 
cause a loss of capacity. Ramp metering (signals on slip roads to control 
access to the motorway) can help avoid this loss of capacity. The design 
of ramp metering strategies has several features in common with the 
design of access control mechanisms in communication networks. 

Inspired by models and rate control mechanisms developed for Inter- 
net congestion control, we propose a Brownian network model as an 
approximate model for a controlled motorway and consider it operating 
under a proportionally fair ramp metering policy. We present an analysis 
of the performance of this model. 

AMS subject classification (MSC2010) 90B15, 90B20, 60K30 



1 Introduction 



The study of heavy traffic in queueing systems began in the 1960s, with 
three pioneering papers by Kingman [26 , [27I. |28| . These papers, and the 
early work of Prohorov [35j [ , Borovkov [g, [6j and Iglehart |20j , concerned 
a single resource. Since then there has been significant interest in net- 
works of resources, with major advances by Harrison and Reiman 19|, 
Reiman [37| . Williams [43] and Bramson For discussions, further ref- 
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erences and overviews of the very extensive literature on heavy traffic 
for networks, Williams 42 1, Bramson and Dai Harrison 17, 18 1 and 
Whitt [41| are recommended. 

Research in this area is motivated in part by the need to understand 
and control the behaviour of communications, manufacturing and ser- 
vice networks, and thus to improve their design and performance. But 
researchers are also attracted by the elegance of some of the mathemat- 
ical constructs: in particular, the multi-dimensional reflecting Brownian 
motions that often arise as limits. 

A question that arises in a wide variety of application areas concerns 
how flows through a network should be controlled, so that the network 
responds sensibly to varying conditions. Road traffic was an area of in- 
terest to early researchers [33| , and more recently the question has been 
studied in work on modelling the Internet. In each of these cases the 
network studied is part of a larger system: for example, drivers gener- 
ate demand and select their routes in ways that are responsive to the 
delays incurred or expected, which depend on the controls implemented 
in the road network. It is important to address such interactions between 
the network and the larger system, and in particular to understand the 
signals, such as delay, provided to the larger system. 

Work on Internet congestion control generally addresses the issue of 
fairness, since there exist situations where a given scheme might max- 
imise network throughput, for example, while denying access to some 
users. In this area it has been possible to integrate ideas of fairness of a 
control scheme with overall system optimization: indeed fairness of the 
control scheme is often the means by which the rig ht information and 



incentives are provided to the larger system [24j, |38[ . 

Might some of these ideas transfer to help our understanding of the 
control of road traffic? In this paper we present a preliminary exploration 
of a particular topic: ramp metering. Unlimited access to a motorway 
network can, in overloaded conditions, cause a loss of capacity. Ramp 
metering (signals on slip roads to control access to the motorway) can 
help avoid this loss of capacity. The problem is one of access control, a 
common issue for communication networks, and in this paper we describe 
a ramp metering policy, proportionally fair metering, inspired by rate 
control mechanisms developed for the Internet. 

The organisation of this paper is as follows. In Section [5] we review 
early heavy traffic results for a single queue. In Section [3] we describe 
a model of Internet congestion control, which we use to illustrate the 
simplifications and insights heavy traffic allows. In Section2]we describe 
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a Brownian network model, which both generalizes a model of Section^ 
and arises as a heavy traffic limit of the networks considered in Section[51 
Sections [3] and 2] are based on the recent results of 2l|, |25[ . These heavy 
traffic models help us to understand the behaviour of networks operating 
under policies for sharing capacity fairly. 

In Section [5] we develop an approach to the design of ramp metering 
flow rates informed by the earlier Sections. For each of three examples, 
we present a Brownian network model operating under a proportionally 
fair metering policy. Our first example is a linear network representing 
a road into a city centre with several entry points; we then discuss a 
tree network, and, in Section [BJ a simple network where drivers have 
routing choices. Within the Brownian network models we show that in 
each case the delay suffered by a driver at an entry point to the net- 
work can be expressed as a sum of dual variables, one for each of the 
resources to be used, and that under their stationary distribution these 
dual variables are independent exponential random variables. For the fi- 
nal example we show that the interaction of proportionally fair metering 
with choices available to arriving traffic has beneficial consequences for 
the performance of the system. 

John Kingman's initial insight, that heavy traffic reveals the essential 
properties of queues, generalises to networks, where heavy traffic allows 
sufficient simplification to make clear the most important consequences 
of resource allocation policies. 



2 A single queue 

In this Section we review heavy traffic results for the M/G/I queue, to 
introduce ideas that will be important later when we look at networks. 

Consider a queue with a single server of unit capacity at which cus- 
tomers arrive as a Poisson process of rate v. Customers bring amounts 
of work for the server which are independent and identically distributed 
with distribution G, and are independent of the arrival process. Assume 
the distribution G has mean X/fi and finite second moment, and that 
the load on the queue, p — vj ' p, satisfies p < 1. 

Let W(t) be the workload in the queue at time t; for a server of unit 
capacity this is the time it would take for the server to empty the queue 
if no more arrivals were to occur after time t. 

Kingman showed that the stationary distribution of (1 — p)W is 
asymptotically exponentially distributed as p — > I. Current approaches 
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to heavy traffic generally proceed via a weaker assumption that the cu- 
mulative arrival process of work satisfies a functional central limit the- 
orem, and use this to show that as p — > 1, the appropriately normalized 
workload process 

W(t) = (l-p) W^-J—V t>0 (2.1) 

can be approximated by a reflecting Brownian motion W on R + . In 
the interior (0,oo) of M + , behaves as a Brownian motion with drift 
— 1 and variance determined by the variance of the cumulative arrival 
process of work. When W hits zero, then the server may become idle; this 
is where delicacy is needed. The stationary distribution of the reflecting 
Brownian motion W is exponential, corresponding to Kingman's early 
result. 

We note an important consequence of the scalings appearing in the 
definition (|2.ip , the snapshot principle. Because of the different scalings 
applied to space and time, the workload is of order (1 — p)^ 1 while 
the workload can change significantly only over time intervals of order 
(1 — p)~ 2 . Hence the time taken to serve the amount of work in the 
queue is asymptotically negligible compared to the time taken for the 



workload to change significantly 3Q, |4l| . 



Note that the workload (W(t),t > 0) does not depend on the queue 
discipline (provided the discipline does not allow idling when there is 
work to be done), although the waiting time for an arriving customer 
certainly does. Kingman [29j makes elegant use of the snapshot principle 
to compare stationary waiting time distributions under a range of queue 
disciplines. 

It will be helpful to develop in detail a simple example. Consider a 
Markov process in continuous time (N(t),t > 0) with state space Z + 
and non-diagonal infinitesimal transition rates 

v if n' = n + 1, 
q(n, n')= \ p. if n' = n — 1 and n > 0, (2.2) 
otherwise. 

Let p — v I p. If p < 1 then the Markov process (N(t),t > 0) has station- 
ary distribution 

P{iV s = n} = (1 - p)p n , n = 0,1,2,... (2.3) 

(here, the superscript s signals that the random variable is associated 
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with the stationary distribution). The Markov process corresponds to 
an M/M/l queue, at which customers arrive as a Poisson process of rate 
v, and where customers bring an amount of work for the server which is 
exponentially distributed with parameter p. 

Next consider an M/G/l queue with the processor-sharing discipline 
(under the processor-sharing discipline, while there are n customers in 
the queue each receives a proportion 1/n of the capacity of the server). 
The process (N(t),t > 0) is no longer Markov, but it nonetheless has 
the same stationary distribution as in (I2.3[) . Moreover in the stationary 
regime, given N s = n, the amounts of work left to be completed on 
each of the n customers in the queue form a collection of n independent 
random variables, each with distribution function 

G*(x) = n ( (1-G(z))dz, x>0, 
Jo 

a distribution recognisable as that of the forward recurrence time in 
a stationary renewal process whose inter-event time distribution is G. 
Thus the stationary distribution of W is just that of the sum of N s inde- 
pendent random variables each with distribution G* , where N s has the 
distribution (|2.3p |2j,[23|- Let S be a random variable with distribution G. 
Then we can deduce that the stationary distribution of (W, N) has the 
property that in probability W s /N s -> E(S 2 )/2E(S), the mean of the 
distribution G* , as p — >• 1. For fixed x, under the stationary distribution 
for the queue, let AJ be the number of customers in the queue with a 
remaining work requirement of not more than x. Then, N^,/N s — > G*(x) 
in probability as p — > 1. At the level of stationary distributions, this is 
an example of a property called state-space collapse: in heavy traffic the 
stochastic behaviour of the system is essentially given by W , with more 
detailed information about the system (in this case, the numbers of cus- 
tomers with various remaining work requirements) not being necessary. 

The amount of work arriving at the queue over a period of time, r, has 
a compound Poisson distribution, with a straightforwardly calculated 
mean and variance of pr and pa 2 r respectively, where a 2 = K(S 2 )/K(S). 
An alternative approach [l5l | is to directly model the cumulative arrival 
process of work as a Brownian motion E = (E(t), t > 0) with matching 
mean and variance parameters: thus 

E(t) = pt + p 1/2 aZ{t), t>0, 

where (Z(t),t > 0) is a standard Brownian motion. Let 

X(t) = E(t) -t, t > 0, 



6 



F. P. Kelly and R. J. Williams 



a Brownian motion starting from the origin with drift —(1 — p) and 
variance pa 2 . In this approach we define the queue's workload W(t) at 
time t by the system of equations 

W{t) = W(0) + X(t) + U(t), t > 0, (2.4) 
U(t) = -m£jt(a), t > 0. (2.5) 

The interpretation of the model is as follows. While W is positive, it 
is driven by the Brownian fluctuations caused by arrival of work less 
the work served. But when W hits zero, the resource may not be fully 
utilized. The process U defined by equation (I2.5[) is continuous and non- 
decreasing, and is the minimal such process that permits W, given by 
equation (|2.4I) . to remain non- negative. We interpret U(t) as the cumu- 
lative unused capacity up to time t. Note that U can increase only at 
times when W is at zero. 

The stationary distribution of W is exponential with mean pa 2 /2(1 — 
p) [ill- This is the same as the distribution of (1 — p) -1 !^ 5 where W s 
has the stationary distribution of the reflecting Brownian motion W 
that approximates the scaled process W given by (12.11) . Furthermore, 
the mean of the stationary distribution of W is the same as the mean 
of the exact stationary distribution of the workload W, calculated from 
its representation as the geometric sum (|2.3[) of independent random 
variables each with distribution G* and hence mean E(5 2 )/2E(5). 

In other words, for the M/G/l queue, we obtain the same exponential 
stationary distribution either by (a) approximating the workload arrival 
process directly by a Brownian motion without any space or time scaling, 
or by (b) approximating the scaled workload process in (|2.ip by a reflect- 
ing Brownian motion, finding the stationary distribution of the latter, 
and then formally unwinding the spatial scaling to obtain a distribution 
in the original spatial units. Furthermore, this exponential distribution 
has the same mean as the exact stationary distribution for the workload 
in the M/G/l queue and provides a rather good approximation, being 
of the same order of accuracy as the exponential approximation of the 
geometric distribution with the same mean. 

The main point of the above discussion is that, in the context of this 
example, we observe that for the purposes of computing approxima- 
tions to the stationary workload, using a direct Brownian model for the 
workload arrival process (by matching mean and variance parameters) 
provides the same results as use of the heavy traffic diffusion approx- 
imation coupled with formal unwinding of the spatial scaling, and the 



Heavy traffic on a controlled motorway 



7 



approximate stationary distribution that this yields compares remark- 
ably well with exact results. We shall give another example of this kind 
of fortuitously good approximation in Section 01 Chen and Yao Q have 
also noted remarkably good results from using such 'strong approxima- 
tions' without any scaling. 



3 A model of Internet congestion 



In this Section we describe a network generalization of processor sharing 
that has been useful in modellin g flo ws through the Internet, and outline 
a recent heavy traffic approach 2lj, |25| to its analysis. 



3.1 Fair sharing in a network 

Consider a network with a finite set J of resources. Let a route i be a 
non-empty subset of J , and write j £ i to indicate that resource j is 
used by route i. Let 1 be the set of possible routes. Assume that both J 
and X are non-empty and finite, and let J and / denote the cardinality 
of the respective sets. Set Aji = 1 if j £ i, and Aji — otherwise. This 
defines a J x I matrix A — (Aji, j £ J , i £ X) of zeroes and ones, the 
resource-route incidence matrix. Assume that A has rank J, so that it 
has full row rank. 

Suppose that resource j has capacity Cj > 0, and that there are rii 
connections using route i. How might the capacities C — {Cj,j £ J) be 
shared over the routes I, given the numbers of connections n — (n, , i £ 
1)1 This is a question which has attracted attention in a variety of fields, 
ranging from game theory, through economics to political philosophy. 
Here we describe a concept of fairness which is a natural extension of 
Nash's bar gain ing solution and, as such, satisfies certain natural axioms 
of fairness 32[ ; the concept has been used extensively in the modelling 
of rate control algorithms in the Internet 0, |38| . 

Let T + (n) = {i £ I : rxj > 0}. A capacity allocation policy A = 
(A(n),n £ where A(n) = (A;(n),i 6 I), is called proportionally 

fair if for each n £ M. 1 ^, A(n) solves 

maximise n^logAj (3-1) 

iel+(n) 

subject to ^AjiAi^Cj, j £ J, (3.2) 

iEl 
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over A, > 0, iel+(n), (3.3) 

A i = 0, iel\l+(n). (3.4) 

Note that the constraint (I3.2[) captures the limited capacity of resource 
j, while constraint (|3.4I) requires that no capacity be allocated to a route 
which has no connections. 

The problem (I3.1I) - (|3.4I) is a straightforward convex optimization 
problem, with optimal solution 

n ■ 

A i {n) = = l — r , iel + K (3.5) 

where the variables q = (qj , j £ J) > are Lagrange multipliers (or 
dual variables) for the constraints (|3.2[) . The solution to the optimiz- 
ation problem is unique and satisfies Aj(n) > for i G X + (n) by the 
strict concavity on (Aj > 0, i g Z+(n)) and boundary behaviour of the 



objective function in (1.6) 25 



The dual variables q are unique if n, > for all i Si, but may not be 
unique otherwise. In any event they satisfy the complementary slackness 
conditions 

9j [c 3 - A ^( n ^j = °> J e ^- ( 3 - 6 ) 



3.2 Connection level model 



The allocation A(n) describes how capacities are shared for a given num- 
ber of connections n» on each route i Si. Next we describe a stochastic 



model 3l| for how the number of connections within the network varies. 

A connection on route i corresponds to continuous transmission of 
a document through the resources used by route i. Transmission is as- 
sumed to occur simultaneously through all the resources used by route 
i. Let the number of connections on route i at time t be denoted by 
Ni(t), and let N(t) = (Ni(t),i £ I). We consider a Markov process in 
continuous time (N(t),t > 0) with state space and non-diagonal 
infinitesimal transition rates 



q(n,n') = < 



vi if n' = n + ei , 

/XjAj(n) if n' — n — ei and n, > 0, (3-7) 
otherwise, 



where is the i-th unit vector in Z{, and !/,-, /i; > 0, i S I. 

The Markov process (N(t),t > 0) corresponds to a model where new 
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connections arrive on route i as a Poisson process of rate and a 
connection on route i transfers a document whose size is exponentially 
distributed with parameter pi. In the case where I = J = 1 and C% = 1, 
the transition rates (I3.7|) reduce to the rates (|2.2p of the M/M/l queue. 



Define the load on route i to be pi — Vij pi for i € X. It is known [J, 1 
that the Markov process is positive recurrent provided 

AjiPi <C h jej. (3.8) 

iei 

These are natural constraints: the load arriving at the network for re- 
source j must be less than the capacity of resource j, for each j € J . 
Let [p] be the I x I diagonal matrix with the entries of p — (pi ,i e I) 
on its diagonal, and define u, [v], p, [p] similarly. 

Each connection on route i brings with it an amount of work for 
resource j which is exponentially distributed with mean 1/pt, for j G i. 
The Markov process N allows us to estimate the workload for each 
resource: define the workload process by 

W(t) = A [p]~ 1 N(t), t>0. (3.9) 



3.3 Heavy traffic 

To approximate the workload in a heavily loaded connection-level model 
by that in a Brownian network model, we view a given connection-level 
model as a member of a sequence of such models approaching the heavy 
traffic limit. More precisely, we consider a sequence of connection-level 
models indexed by r where the network structure, defined by A and C, 
does not vary with r. Each member of the sequence is a stochastic system 
as described in the previous section. We append a superscript of r to 
any process or parameter associated with the r th system that depends 
on r. Thus, we have processes N r , W r , and parameters v r . We suppose 
p r = p for all r, so that p[ = v\ j '/x.;, for each i £ 1. We shall assume 
henceforth that the following heavy traffic condition holds: as r — > oo, 

v r -> v and r (Ap r - C) -)■ -6 (3.10) 

where Vj > and 6j > for all j £ J". Note that (|3.10p implies that 
p r — >• p as r —> co and that Ap = C. 

We define fluid scaled processes N r , W r as follows. For each r and 
t > 0, let 

N r (t) =N r (rt)/r, W r (t) =W r (rt)/r. (3.11) 
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What might be the limit of the sequence {N r } as r — > oo? From the 
transition rates (|3.7p and the observation that A(rn) = A(n) for r > 0, 
we would certainly expect that the limit satisfies 

^- n i {t) = v i - ^Ki{n{t)), iel, (3.12) 
at 

whenever n is differentiable at t and rii (t) > for all iel. Indeed, this 



forms part of the fluid model developed in [25[ as a functional-law-of- 
large-numbers approximation. Extra care is needed in defining the fluid 
model at any time t when ni(t) = 0, for any i G I; the function A(n) 
may not be continuous on the boundary of the region R+, and so when 
any component NJjt) is hitting zero, A(N r (t)) may jitter. 

It is shown in [25] that the set of invariant states for the fluid model 

is 

J\f = < n G : rt,; = pi qjAji,i G 1 for some q G 

as we would expect from formally setting the derivatives in (|3.12j) to 
zero and using relation (|3.5[) . Call AT the invariant manifold. If n G A/", 
then since A has full row rank the representation of n in terms of q is 
unique; furthermore, Ai(n) = pi for i G and then since = C, 

the vector g satisfies equation Q3.5p and the complementary slackness 
conditions (|3.6D . and hence gives dual variables for the optimization 
problem (pU]) - (pT4j) . 

For each n G R+, define w(n) — (vjj(n),j G J\ the workload asso- 
ciated with n, by w(n) = ^4[/i] -1 rt. For each w G R+, define A(u>) to 
be the unique value of n G R+ that solves the following optimization 
problem: 

minimize F(n) 

subject to ^^Aji— > wj, j G J, 



over n, > 0, i G I, 



where 



™i „ ^ to/ 



The function F(n) was introduced in Q and can be used to show positive 
recurrence of N under conditions (|3.8[) . In [25] the difference F(n) — 
F(A(w(n))) is used as a Lyapunov function to show that any fluid model 
solution (n(i),t > 0) converges towards the invariant manifold A/". It is 
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straightforward to check that n G TV if and only if n = A(w(n)) and it 
turns out that 

AH = [p]A , (A\p]~ 1 [v]\ji]~ 1 A')~ 1 w. 

Note that if N r lives in the space Af then W r , given by equations (13.91) 
and (|3.1ip as A[p]~ 1 N r , lives in the space W = A [p]- 1 Af, which we 
can write as 

W = |w G : w = Alp]- 1 [vWfi]- 1 A' q for some q G R^V (3.13) 

generally a space of lower dimension. Call W the workload cone. Let 

W J = {to G R{ : w = Alpl^lvllp]- 1 A' q 

1 r , (3-14) 

for some g G satisfying q<j = f , 

which we refer to as the j th face of the workload cone W. 

We define diffusion scaled processes N r , W r as follows. For each r and 
t > 0, let 

N r (r 2 t) - W r (r 2 t) 

N r (t) = { 1 , W r it) = { ' . 
r r 

In the next sub-section we outline the convergence in distribution of the 
sequence {(N r , W r )} as r — > oo. As preparation, note that if N(t) G Af 
and Ni(t) > for all i el, then Ai(N(t)) — pi for all i G X. Suppose, as 
a thought experiment, that for each i 6 1 the component NT behaves 
as the queue-length process in an independent M/M/l queue, with a 
server of capacity pi . Then a Brownian approximation to NT would have 
variance + fiipi = 2vi. Next observe that if the covariance matrix of 
N r is 2[v] then the covariance matrix of W r = A [p]~ 1 N r is 

r = 2A[p]- 1 [v][ii]- 1 A 1 . (3.15) 



4 A Brownian network model 

Let (A, v, pfj be as in Section [3j thus A is a matrix of zeroes and ones of 
dimension J x I and of full row rank, and v, p are vectors of positive 
entries of dimension /. Let pi = z^/ /ij, i G I, and p = (pi,i G I). Let 
W and W 3 be defined by expressions (|3.13[) and (13.14[) respectively. Let 
9 G M J and T be given by (pTT5l) . 

In the following, all processes are assumed to be defined on a fixed 
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filtered probability space (fi, F, {.Ft}, P) and to be adapted to the filtra- 
tion {Ft}. Let 77 be a probability distribution on W. Define a Brownian 
network model by the following relationships: 

(i) W(t) = W(0) + X{t) + U(t) for all t > 0, 

(ii) W has continuous paths, W(t) E W for all t > 0, and W(0) has 
distribution 77, 

(iii) X is a J-dimensional Brownian motion starting from the origin 
with drift —9 and covariance matrix T such that {X(t) + 9t, Ft, t > 
0} is a martingale under P, 

(iv) for each j e J % Uj is a one-dimensional process such that 

(a) Uj is continuous and non-decreasing, with Uj(0) = 0, 

(b) U,(t) = f* l { ^ WeWJ} dUj(s) for all t > 0. 

The interpretation of the above Brownian network model is as follows. 
In the interior of the workload cone W each of the J resources are fully 
utilized, route i is receiving a capacity allocation pi for each i £ X, 
and the workloads W are driven by the Brownian fluctuations caused 
by arrivals and departures of connections. But when W hits the j th 
face of the workload cone W, resource j may not be fully utilized. The 
cumulative unused capacity Uj at resource j is non-decreasing, and can 
increase only on the j th face of the workload cone W. 

The work of Dai and Williams establishes the existence and 



uniqueness in law of the above diffusion W — (W(t),t > 0). In 21 1 
it is shown that, if 9j > for all j £ J, then W has a unique sta- 
tionary distribution; furthermore, if W s denotes a random variable with 
this stationary distribution, then the components of Q s — 2T~ 1 W S are 
independent and Qj is exponentially distributed with parameter 9j for 
each j e J . 

Now let C be a vector of positive entries of dimension J, define a 
sequence of networks as in Section 1531 and suppose 9 and C are related 



by the heavy traffic condition (I3.10[) . In 2l| it is shown that, subject 
to a certain local traffic condition on the matrix A and suitable conver- 
gence of initial variables (W r (0) , N r (0)) , the pair (W r ,N r ) converges 
in distribution as r — >• 00 to a continuous process (W,N) where W is 
the above diffusion and N = A(W). The proof in [21( relies on both 
the existence and uniqueness results of [l(]| and an associated invariance 
principle developed by Kang and Williams ■ (The local traffic condi- 
tion under which convergence is established requires that the matrix A 
contains amongst its columns the columns of the J x J identity matrix: 
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this corresponds to each resource serving at least one route which uses 
only that resource. The local traffic condition is not needed to show that 
W has the aforementioned stationary distribution; that requires only the 
weaker condition that A have full row rank.) 

It is convenient to define Q = 2T~ 1 W, a process of dual variables. 
From this, the form of A, and the relation N — A(W), it follows that 
N = [p]A'Q. The dimension of the space in which Q lives is J, and so 
this is an example of state-space collapse, with the /-dimensional process 
N living on a J-dimensional manifold where J < I is often considerably 
less than /. 

Using the stationary distribution for W, we see that N s = [p]A'Q s 
has the stationary distribution of N. Then, after formally unwinding the 
spatial scaling used to obtain our Brownian approximation, we obtain 
the following simple approximation for the stationary distribution of the 
number-of-connections process in the original model described in Section 



JV^ft^Q^, iel, (4.1) 

where Qj, j S J ', are independent and Qj is exponentially distributed 
with parameter Cj — Yliex A ji Pi- 

As mentioned in Section^ an alternative approach is to directly model 
the cumulative arrival process of work for each route i as a Brownian 
motion: 

2^ 1/2 



Ei(t)= Pi t+ Zi(t), t>0, 

where (Zi(t),t > 0), i € I, are independent standard Brownian motions; 
here the form of the variance parameter takes account of the fact that 
the document sizes are exponentially distributed. Under this model, the 
potential netfiow (inflow minus potential outflow, ignoring undcrutiliza- 
tion of resources) process of work for resource j is 

Xj(t) = ^2 AjiEi(t) - Cjt, t > 0, 

a J-dimensional Brownian motion starting from the origin with drift 
Ap — C and covariance matrix 2A[p][/i]~ 1 J 4' = T. Then the workload 
is modelled by a J-dimensional process W that satisfies properties (i)- 
(iv) above, but with W in place of W and Ap — C in place of the drift 
—6; the covariance matrix remains the same. By the results of 2l|, if 



Ap < C, there is a unique stationary distribution for the process W 
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such that if W s has this stationary distribution then the components of 
Q s = 2r _1 V^ s are independent and Q| is exponentially distributed with 
parameter Cj — ^2 i£l AjiPi for each j 6 J . The random variable 

N s = [ P ]A'Q S , (4.2) 

has the stationary distribution of N — [p]A'Q, which is the same as the 
distribution of the right member of (I4.1[) . Thus, just as in the simple 
case considered in Section [5J in this connection-level model, using the 
direct Brownian model yields the same approximation for the stationary 
distribution of the number-of-connections process as that obtained using 
the heavy traffic diffusion approximation and formally unwinding the 
spatial scaling in its stationary distribution. 

If we specialize the direct Brownian network model to the case where 
I = J = 1 and (7=1, then we obtain the Brownian model of Section [21 
with T = 2v j p? — pa 2 and where the stationary distribution for W 
is exponentially distributed with mean pa 2 /2(1 — p), yielding the same 
approximation as in Section [2] 

A more interesting example is obtained when I = J + 1 and A is the 
J x ( J + 1) matrix: 



A 



/ 1 ... 1 \ 
1 ... 1 



V ... 1 1 / 



so that J routes each use a single resource in such a way that there 
is exactly one such route for each resource, and one route uses all J 
resources. In this case, the stationary distribution given by (|4.2p ac- 
cords remarkably well with the exact stationary distribution described 
by Massoulie and Roberts [3l| ; it is again of the order of accuracy of the 
exponential approximation of the geometric distribution with the same 



mean. (We refer the interested reader to [2l| for the details of this good 
approximation.) 

In this Section and in Section [2] we have seen intriguing examples of 
remarkably good approximations that the direct Brownian modelling 
approach can yield. Inspired by this, in the next two Sections we explore 
the use of the direct Brownian network model as a representation of 
workload for a controlled motorway. Rigorous justification for use of this 
modelling framework in the motorway context has yet to be investigated. 
See the last section of the paper for further comments on this issue. 



Heavy traffic on a controlled motorway 
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Once motorway traffic exceeds a certain threshold level (measured in 
terms of density — the number of vehicles per mile) both vehicle speed 
and vehicle throughput drop precipitously 13j, |l2j, |39[ . The smooth pat- 
tern of flow that existed at lower densities breaks down, and the driver 
experiences stop-go traffic. Maximum vehicle throughput (measured in 
terms of the number of vehicles per minute) occurs at quite high speeds — 
about 60 miles per hour on Californian freeways and on London's orbital 
motorway, the M25 IH 12, 3^| — while after flow breakdown the average 
speed may drop to 20-30 miles per hour. Particularly problematic is that 
flow breakdown may persist long after the conditions that provoked its 
onset have disappeared. 

Variable speed limits lessen the number and severity of accidents on 
congested roads and are in use, for example, on the south-west quadrant 
of the M25. But variable speed limits do not avoid the loss of throughput 
caused by too high a density of vehicles [l, 14 1. Ramp metering (signals 
on slip roads to control access to the motorway) can limit the de nsity 
of vehicles, and thus can avoid the loss of throughput M & 
But a cost of this is queueing delay on the approaches to the motorway. 
How should ramp metering flow rates be chosen to control these queues, 
and to distribute queueing delay fairly over the various users of the 
motorway? In this Section we introduce a modelling approach to address 
this question, based on several of the simplifications that we have seen 
arise in heavy traffic. 



5.1 A linear network 

Consider the lineaiQ road network illustrated in Figure 15.11 Traffic can 
enter the main carriageway from lines at entry points, and then travels 
from left to right, with all traffic destined for the exit at the right hand 
end (think of this as a model of a road collecting traffic all bound for a 
city). Let M\{t), M 2 (t), ... , Mj(t) taking values in R + be the line sizes@ 
at the entry points at time t, and let C\, C2, . . . , Cj be the respective 
capacities of sections of the road. We assume the road starts at the left 
hand end, with line J feeding an initial section of capacity Cj, and that 

1 We caution the reader that here we use the descriptive term 'linear network' in a 
manner that differs from its use in 12 II . 

2 The term line size is used here to mean a quantity measuring the amount of work 
in the queue, rather than the more restrictive number of jobs that is often 
associated with the term queue size. 
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Figure 5.1 Lines of size mi, m.2, (713, m4 are held on the slip roads 
leading to the main carriageway. Traffic on the main carriageway is 
free-flowing. Access to the main carriageway from the slip roads is 
metered, so that the capacities Ci, C2, C3, C4 of successive sections 
are not overloaded. 



Ci > C2 > . . . > Cj > 0. The corresponding resource-route incidence 
matrix is the square matrix 



/ 1 




.4 



1 



V 



(5.1) 



1 / 



We model the traffic, or work, arriving at line i, i £ I, as follows: let 
Ei{t) be the cumulative inflow to line i over the time interval (0, t], and 
assume (Ei(t), t > 0) is an ergodic process with non- negative, stationary 
increments, with E[i?i(t)] = pit, where pi > 0, and suppose these pro- 
cesses are independent over i El. Suppose the metering rates for lines 
1, 2, . . . , J at time t can be chosen to be any measurable vector- valued 
function A = K(M{t)) satisfying constraints (ET2"1) - (pn)) with n = M(t), 
and such that 



Mi(t) =M i (0)+E i (t) 



Ai(M(s))ds > 0, 



t > 



(5.2) 



for i € I. Observe that we do not take into account travel time along 
the road: motivated by the snapshot principle, we suppose that M(-) 
varies relatively slowly compared with the time taken to travel through 
the system^ 

3 The time taken for a vehicle to travel through the system comprises both the 
queueing time at the entry point and the travel time along the motorway. If the 
motorway is free-flowing, the aim of ramp metering, then the travel time along 
the motorway may be reasonably modelled by a constant not dependent on M(t), 
say Ti from entry point i. A more refined treatment might insist that the rates 
A(Mi(t — Ti),i £ X) satisfy the capacity constraints 1 13.211 . We adopt the simpler 
approach, since we expect that in heavy traffic travel times along the motorway 
will be small compared with the time taken for M{t) to change significantly. 
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How might the rate function A(-) be chosen? We begin by a discus- 
sion of two extreme strategies. First we consider a strategy that pri- 
oritises the upstream entry points. Suppose the metered rate from line 
J, (Aj(M(t)),t > 0), is chosen so that for each t > the cumulative 
outflow from line J, J Q Aj(M(s)) ds, is maximal, subject to the con- 
straint (|5.2[) and Aj(M(t)) < Cj for alH > 0: thus there is equality in 
the latter constraint whenever Mj(t) is positive. For each of j = J — 1, 
J — 2, . . . , 1 in turn define J* Aj(M(s)) ds to be maximal, subject to the 
constraint (|5.2D and 

J 

Aj(M(t))<Cj- A .( M (')). *>0- ( 5 - 3 ) 
i=j+i 

In consequence there is equality in constraint (|5.3[) at time t if Mj (t) > 
0, and by induction for each t > the cumulative flow along link j, 
Jo Ei=j Aj(M(s)) ds, is maximal, for j = J, J — 1, 1. Thus this 
strategy minimizes, for all times t, the sum of the line sizes at time t, 

The above optimality property is compelling if the arrival patterns of 
traffic are exogenously determined. The strategy will, however, concen- 
trate delay upon the flows entering the system at the more downstream 
entry points. This seems intuitively unfair, since these flows use fewer 
of the system's resources, and it may well have perverse and suboptimal 
consequences if it encourages growth in the load pt arriving at the up- 
stream entry points. For example, growth in pj may cause the natural 
constraint (|3.8|l to be violated, even while traffic arriving at line J suffers 
only a small amount of additional delay. 

Next we consider a strategy that prioritises the downstream entry 
points. To present the argument most straightforwardly, let us suppose 
that the cumulative inflow to line i is discrete, i.e., (Ei(t),t > 0) is con- 
stant except at an increasing, countable sequence of times t € (0, oo), for 
each iGl Suppose the inflow from line 1 is chosen to be Ai(M(t)) = C\ 
whenever M±(t) is positive, and zero otherwise. Then link 1 will be fully 
utilized by the inflow from line 1 a proportion p\/C\ of the time. Let 
Aj(M(t)) = Cj whenever both Mj(t) is positive and A,-(M(t)) = for 
i < j, and let Aj(M(t)) = otherwise. This strategy minimizes lexico- 
graphically the vector (Mi(t), M 2 (t) 7 . . . , Mj(t)) at all times t. Provided 
the system is stable, link 1 will be utilized solely by the inflow from line 



18 



F. P. Kelly and R. J. Williams 



j a proportion pj/Cj of the time. Hence the system will be unstable if 

3 = 1 J 

and thus may well be unstable even when the condition (13.8[) is satisfied. 
Essentially the strategy starves the downstream links, preventing them 
from working at their full capacity. Our assumption that the cumulative 
inflow to line i is discrete is not essential for this argument: the stability 
region will be reduced from (|3.8p under fairly general conditions. 

The two extreme strategies we have described each have their own 
interest: the first has a certain optimality property but distributes delay 
unfairly, while the second can destabilise a network even when all the 
natural capacity constraints (|3.8[) are satisfied. 



5.2 Fair sharing of the linear network 

In this sub-section we describe our preferred ramp metering policy for 
the linear network, and our Brownian network model for its performance. 

Given the line sizes M(t) = m, we suppose the metered rates A(m) are 
chosen to be proportionally fair: that is, the capacity allocation policy 
A(-) solves the optimization problem (|3.ip ~ (|3.4p . Hence for the linear 
network we have from relations (I3.5p - (I3.6D that 

Ai(m) = — — — , i G 2+(m), 
where the qj are Lagrange multipliers satisfying 

Qj > o, f c, - A *M j = °> ie J- (5-4) 

Under this policy the total flow along section j will be its capacity Cj 
whenever qj > 0. 

Given line sizes M(t) = m, the ratio mj/Aj(m) is the time it would 
take to process the work currently in line i at the current metered rate 
for line i. Thus 

i 

dj = y^gj, i el, (5.5) 

j=i 

give estimates, based on current line sizes, of queueing delay in each 
of the I lines. Note that these estimates do not take into account any 
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change in the line sizes over the time taken for work to move through 
the line. 

Next we describe our direct Brownian network model for the linear 
network operating under the above policy. We make the assumption that 
the inflow to line i is a Brownian motion Ei — (Ei(t),t > 0) starting 
from the origin with drift pi and variance parameter pier 2 , and so can 
be written in the form 

Ei(t) = pit + pl /2 aZi(t), t > (5.6) 

for j 6l, where (Zi(t), t > 0), i € I, are independent standard Brownian 
motions. For example, if the inflow to each line were a Poisson process, 
then this would be the central limit approximation, with a = 1. More 
general choices of a could arise from either a compound Poisson process, 
or the central limit approximation to a large class of inflow processes. 

Our Brownian network model will be a generalization of the model 
(|2.4p (|2.5p of a single queue, and a specialization of the model of Sec- 
tion 2] to the case where pi — 2 /a 2 , i € X, and the matrix A is of the 
form (pTTj) . 

Let 

X, (t) = J2 AjiEi (i) - Cjt, t > 0; 

note that the first term is the cumulative workload entering the system 
for resource j over the interval (0,f\. Write X(t) — (Xj(t),j E J) and 
X = (X(t),t > 0). Then X is a J-dimensional Brownian motion starting 
from the origin with drift Ap — C and covariance matrix T — a 2 A [p]A'. 
We assume the stability condition (|3.8p is satisfied, so that Ap < C. 
Write 

W = A[p]A'Rl (5.7) 

for the workload cone, and 

W ={A[p]A'q:qeR J + ,q j - 0}, (5.8) 

for the j th face of W. Our Brownian network model for the resource 
level workload AM is then the process W defined by properties (i)-(iv) 
of Sectiong]with W in place of W, C~Ap in place of 6 and V = a 2 A[p]A'. 

The form (|5.ip of the matrix A allows us to rewrite the workload 
cone (I5.7P as 

met : i < *±i, j = 1,2, . . . , J > 

Pj-i Pi J 
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where wj+i = and we interpret the left hand side of the inequality 
as when j = 1. Under this model, at any time t when the workloads 
W(t) are in the interior of the workload cone W, each resource is fully 
utilized. But when W hits the j face of the workload cone W, resource 
j may not be fully utilized. Our model corresponds to the assumption 
that there is no more loss of utilization than is necessary to prevent W 
from leaving W. This assumption is made for our Brownian network 
model by analogy with the results reviewed in Sections [3] and 21 where 
it emerged as a property of the heavy traffic diffusion approximation. 

In a similar manner to that in Section 01 we define a process of 
dual variables: Q — (A[p]A')~ 1 W . Since W is our model for AM, our 
Brownian model for the line sizes is given by 

M = A- X W = [p]A'Q. (5.9) 

Within our Brownian model we represent (nominal) delays D = (Di, i £ 
I) at each line as given by 

F> = A'Q, (5.10) 

since these would be the delays if line sizes remained constant over the 
time taken for a unit of traffic to move through the line, with pi both 
the arrival rate and metered rate at line «Q Relation (|5.10[) becomes, for 
the linear network, 

i 

Di = J2Qj, i = l,2,...,J, 
3=1 

parallelling relation (I5.5[) . Note that when W hits the j th face of the 
workload cone W, then Qj = and Dj-\ = Dj; thus the loss of utiliza- 
tion at resource j when W hits the j th face of the workload cone W is 
just sufficient to prevent the delay at line j becoming smaller than the 
delay at the downstream line j ' — 1. 

If W s has the stationary distribution of W, then the components of 
Q s = (AlpjA') -1 W s are independent and Qj is exponentially distributed 

4 The nominal delay Di(t) for line i at time t will not in general be the realized 
delay (the time taken for the amount of work Mi (t) found in line i at time t to 
be metered from line i). Since Ap < C the metered rate Aj(m) will in general 
differ from pi even when m = A —1 W and W S W. Our definition of nominal 
delay is informed by our earlier heavy traffic results: as Ap approaches C we 
expect scaled realized delay to converge to scaled nominal delay. Metered rates 
do fluctuate as a unit of traffic moves through the line, but we expect less and 
less so as the system moves into heavy traffic. 
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with parameter 

^f C i-I>j> J = 1,2,..., J. 

The stationary distributions of M and D are then given by the distri- 
butions of M s and D s , respectively, where 

i i 

M? = Pi J2Q S v Di = Y,Qj i = l,2,...,J. 

3=1 3=1 

In the above example the matrix A is invertible. As an example of 
a network with a non-invertible A matrix, suppose that in the linear 
network illustrated in Figure 15.11 one section of road is unconstrained, 
say C3 = co. Then, removing the corresponding row from the resource- 
route incidence matrix we have 

/ 1 1 1 1 

A= 111 

V o o o i 

The workload cone is the collapse of W obtained by setting u>2 = u>3, 
and in consequence the construction of W and M = [p]A' (AlplA') -1 W 
enforces the relationship M2/M3 — P2/ Ps- Since the matrix A is not 
invertible, this is no longer a necessary consequence of the network to- 
pology, but is a natural modelling assumption, motivated by the forms 
of state-space collapse we have seen earlier. Essentially lines 2 and 3 use 
the same network resources and face the same queueing delays. 

A Brownian network model of the first strategy from Section lOl could 
also be constructed, but the workload cone and its faces would not be 
of the required form (|5.7p and (|5.8[) . but instead would be defined by 

W = {w 6 R J : < wj < . . . < w 2 < w-l}, (5.11) 

and the requirement that if w £ ~W 3 then wj = Wj+±, with the in- 
terpretation wj+i = 0. Thus face j represents the requirement that the 
workload for resource j comprises at least the workload for resource j + 1 , 
for j = 1, 2, . . . , J. Under this model, resource j is fully utilized except 
when W hits the j th face of the workload cone (|5.11l) : it is not possible 
for W to leave W, since the constraints expressed in the form (|5.11l) 
follow necessarily from the topology of the network embodied in A. The 
model corresponds to the assumption that there is no more loss of util- 
ization than is a necessary consequence of the network topology. Note 
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Figure 5.2 There are six sources of traffic, starting in various lines 
and all destined to eventually traverse section f of the road. Once 
traffic has passed through the queue at its entry point, it does not 
queue again. 



that the proportionally fair policy may fail to fully utilize a resource not 
only when this is a necessary consequence of the network topology, but 
also when this would cause an upstream entry point to obtain more than 
what the policy considers a fair share of a scarce downstream resource. 



5.3 A tree network 

Next consider the tree network illustrated in Figure [521 Access is meter- 
ed at the six entry points so that the capacities C±, C2, ■ ■ ■ , Cq are not 
overloaded. There is no queueing after the entry point, and the capacities 
satisfy the conditions C3 < C2, C5 + Cq < C4, C2 + C4 <C\. 

Given the line sizes M(t) — m, we suppose the metered rates A (to) are 
chosen to be proportionally fair: that is, the capacity allocation policy 
A(-) solves the optimization problem p.l[) - p.4p where for this network 
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We assume, as in the last Section, that the cumulative inflow of work 
to line i is given by equation (|5.6[) for i £ I, where (Zi(t), t > 0), i S I, 
are independent standard Brownian motions. Our Brownian network 
model is again the process W defined by properties (i)-(iv) of Section 0] 
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with W in place of W, C - Ap in place of 6, T = a 2 A[p]A\ and with 
the workload cone and its faces defined by equations (I5.7I) - (|5.8I) for the 
above choice of A. We assume the stability condition (|3.8p is satisfied, 
so that all components of C — Ap are positive. 

If W s denotes a random variable with the stationary distribution of 
W, then the components of Q s = (AlplA')^ 1 ]^ 3 are independent and 
Qj is exponentially distributed with parameter 2a~ 2 (Cj — J2iAjiPi) 
for each j £ J . The Brownian model line sizes and delays are again 
given by equations (|5.9[) and (|5.10[) respectively, each with stationary 
distributions given by a linear combination of independent exponential 
random variables, one for each section of road. 

A key feature of the linear network, and its generalization to tree 
networks, is that all traffic is bound for the same destination. In our 
application to a road network this ensures that all traffic in a line at 
a given entry point is on the same route. If traffic on different routes 
shared a single line it would not be possible to align the delay incurred 
by traffic so precisely with the sum of dual variables for the resources to 
be usedd 



6 Route choices 

Next consider the road network illustrated in Figure 16.11 Three parallel 
roads lead into a fourth road and hence to a common destination. Access 
to each of these roads is metered, so that their respective capacities C\ , 
C2, C3, C4 are not overloaded, and C\ + C2 + C3 < C4. There are four 
sources of traffic with respective loads p±, P2, P3, pi- the first source has 
access to road 1 alone, on its way to road 4; the second source has access 
to both roads 1 and 2; and the third source can access all three of the 
parallel roads. We assume that traffic arriving with access to more than 
one road distributes itself in an attempt to minimize its queueing delay, 
an assumption whose implications we shall explore. 

We could view sources of traffic as arising in different geographical 
regions, with different possibilities for easy access to the motorway net- 
work and with real time information on delays. Or we could imagine a 

5 The tree topology of Figure 15.21 ensures that the queueing delays in the 

proportionally fair Brownian network model are partially ordered. A technical 
consequence is that a wide class of fair capacity allocations, the o-fair 
allocations, share the same workload cone: in the notation of [2l| . the cone W a 
does not depend upon a. 
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Figure 6.1 Three parallel roads lead to a fourth road and hence to 
a common destination. Lines of size mi, mi, frc.3, 7714 are held on the 
slip roads leading to these roads. There are four sources of traffic: 
sources 2 and 3 may choose their first road, with choices as shown. 



priority access discipline where some traffic, for example high occupancy 
vehicles, has a larger set of lines to choose from. 

Given the line sizes M(t) — m, we suppose the metered rates A(m) are 
chosen to be proportionally fair: that is, the capacity allocation policy 
A(-) solves the optimization problem p.l[) - fl3.4[l . For this network 
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and so, from relations (|3.5I) - (|3.6I) . 

, . , m, . . 7774 

Ai(m) = — , 7 = 1,2,3, A 4 (m) = — . 

q% + q± 94 

We assume the ramp metering policy has no knowledge of the rout- 
ing choices available to arriving traffic, but is simply a function of the 
observed line sizes m, the topology matrix A and the capacity vector C . 

How might arriving traffic choose between lines? Well, traffic that 
arrives when the line sizes are m and the metered rates are A(m) might 
reasonably consider the ratios rrii/Ki{m) in order to choose which line 
to join, since these ratios give the time it would take to process the work 
currently in line i at the current metered rate for line 7, for i = 1, 2, 3. 
But these ratios are just qi + qi for 7 = 1,2, 3. Given the choices available 
to the three sources, we would expect exercise of these choices to ensure 
that qi > <72 > 93, or equivalently that the delays through lines 1, 2, 3 
are weakly decreasing. 
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Because traffic from sources 2 and 3 has the ability to make route 
choices, condition (|3.8p is sufficient, but no longer necessary, for stability. 
The stability condition for the network of Figure 16.11 is 

j j 4 

£><X)Ci, 3 = 1,2,3, 5>,<C 4 , (6.1) 

2—1 i— 1 i— 1 

and is thus of the form (|3.8[) . but with A and C replaced by A and C 1 
respectively, where 
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The forms A, C capture the concept of four virtual resources of capacities 
Cj, j = 1, 2, 3, 4. Given the line sizes m = (roj,i = 1,2,3,4), the 
workloads w = (wi,i — 1,2,3,4) for the four virtual resources are w = 
Am. 

For i = 1, 2, 3, 4, we model the cumulative inflow of work from 
source i over the interval (0, t] as a Brownian motion Ei = (Ei(t),t > 0) 
starting from the origin with drift pi and variance parameter piO 1 that 
can be written in the form (|5.6I) . where (Zi(t),t > 0), i = 1, 2, 3, 4, 
are independent standard Brownian motions. Let _E = , i = 1, 2, 3, 4) 
and let X = (X, , j S ^7) be defined by 

= AS(t) - Ct, i > 0. 

Then X is a four-dimensional Brownian motion starting from the origin 
with drift Ap — C and covariance matrix T = a 2 A [p]A'. We assume the 
stability condition (|6.1|) is satisfied, so that all components of the drift 
are strictly negative. Let W, W" be defined by (|5.7p . (|5.8|) respectively, 
with A replaced by A. 

Our Brownian network model for AM is then the process W defined 
by properties (i)-(iv) of Section 2] with W in place of W and C — Ap in 
place of 8. 

Define a process of dual variables for the virtual resources: Q — 
(A[/9]A') _1 iy. Since W is our model for AM, our Brownian model for 
the line sizes is given by 



M = A~ 1 W= [p]A'Q. 
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Our Brownian model for the delays at each line is given by 

D = A'Q, 
which from the form of A becomes 

4 

Di = J2Qj, i = 1,2,3,4. 

Thus at any time t when Qi{t) > 0, Q 2 {t) > then L>i(t) > D 2 (t) > 
Ds(t), and the incentives for arriving traffic are such that traffic from 
source i joins line i. However if Qi(t) = 0, and so L>i(t) = D 2 (t), then 
arriving traffic from stream 2 may choose to enter line 1, and thus con- 
tribute to increments of the workload for virtual resource 1, whilst still 
contributing to the workload for virtual resources 2, 3 and 4. Our model 
corresponds to the assumption that no more traffic does this than is 
necessary to keep Qi non-negative, or equivalently to keep D\ > D 2 . 
Similarly if Q 2 (t) = then D 2 {t) = D 3 (t), and arriving traffic from 
stream 3 may choose to enter line 2, and thus contribute to increments 
of the workload for virtual resource 2, whilst still contributing to the 
workload for virtual resources 3 and 4; we suppose just sufficient traffic 
does this to keep Q 2 non- negative, or equivalently to keep D 2 > D^. 
Finally if Q^{t) = or Q±{t) = then (real) resource 3 or 4 respect- 
ively may not be fully utilized, as in earlier examples, and our model 
corresponds to the assumption that there is no more loss of utilization 
at (real) resources 3 and 4 than is necessary to prevent W from leaving 
W. 

If W s is a random variable with the stationary distribution of W, then 
the components of Q s = (A[/?]A') _1 M^ S are independent and for j = 1, 
2, 3, 4, Qj is exponentially distributed with parameter (j where 



Ci = 


Jj(Ci-pi), 








4(^1+^2- 


Pi 


- P2), 




^(C!+C 2 4 
<J Z 


c 3 


~ Pi ~ Pi ~ Pi), 


C4 = 




P2 - 


- P3 - Pi)- 



Under the Brownian network model, the stationary distribution for line 
sizes and for delays at each line are given by the distributions of M s and 
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IK Y, $3 * = 1.2,3,4. 



4 



The Brownian network model thus corresponds to natural assump- 
tions about how arriving traffic from different sources would choose their 
routes. The results on the stationary distribution for the network are in- 
triguing. The ramp metering policy has no knowledge of the routing 
choices available to arriving traffic, and hence of the enlarged stability 
region (|6.ip . Nevertheless, under the Brownian model, the interaction of 
the ramp metering policy with the routing choices available to arriving 
traffic has a performance described in terms of dual random variables, 
one for each of the virtual resources of the enlarged stability region; when 
a driver makes a route choice, the delay facing a driver on a route is a 
sum of dual random variables, one for each of the virtual resources used 
by that route; and under their stationary distribution, the dual random 
variables arc independent and exponentially distributed. 



The design of ramp metering strategies cannot assume that arriving 
traffic flows are exogenous, since in general drivers' behaviour will be 
responsive to the delays incurred or expected. In this paper we have 
presented a preliminary exploration of an approach to the design of ramp 
metering flow rates informed by earlier work on Internet congestion con- 
trol. A feature of this approach is that it may prove possible to integrate 
ideas of fairness of a control policy with overall system optimization. 

There remain many areas for further investigation. In particular, we 
have seen intriguing examples, in the context of a single queue and of In- 
ternet congestion control, of remarkably good approximations produced 
for the stationary distributions of queue length and workload by use of 
the direct Brownian modelling approach. Furthermore, in the context 
of a controlled motorway, where a detailed model for arriving traffic is 
not easily available, use of a direct Brownian model has enabled us to 
develop an approach to the design and performance of ramp metering 
and in the context of that model to obtain insights into the interaction 
of ramp metering with route choices. Nevertheless, we expect that the 
use of direct Brownian network models will not always produce good 
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results. Indeed, it is possible that such models may be suitable only 
when the scaled workload process can be approximated in heavy traffic 
by a reflecting Brownian motion that has a product-form stationary dis- 
tribution. We believe that understanding when the direct method is a 
good modelling approach and when it is not, and obtaining a rigorous 
understanding of the reasons for this, is an interesting topic worthy of 
further research. 
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